Amending  Moore’s  Law  for  Embedded 

Applications 


Agenda 


29  Sep  2004 


Sensema  k  i  ng/App  rec  i  ation 


Ground  Truth 


Knowledge 


Information 


Situation 


Declsion-m  aking 


Info 
Portrayal 


Trackin 


Detect 


Broadening  &  Climbing  the 
Cognitive  Pyramid  of  Computation 


Dr.  Richard  W.  Linderman 

Senior  Scientist,  Advanced 
Computing  Architectures 

AFRL  Information  Directorate 


1 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1.  REPORT  DATE 

01  FEB  2005 

2.  REPORT  TYPE 

N/A 

3.  DATES  COVERED 

4.  TITLE  AND  SUBTITLE 

Amending  Moores  Law  for  Embedded  Applications 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS (ES) 

AFRL  Information  Directorate 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS  (ES) 

10.  SPONSOR/MONITOR’S  ACRONYM(S) 

11.  SPONSOR/MONITOR’S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release,  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

See  also  ADM001742,  HPEC-7  Volume  1,  Proceedings  of  the  Eighth  Annual  High  Performance  Embedded 
Computing  (HPEC)  Workshops,  28-30  September  2004. ,  The  original  document  contains  color  images. 

14.  ABSTRACT 

15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION  OF 
ABSTRACT 

uu 

18.  NUMBER 
OF  PAGES 

12 

19a.  NAME  OF 
RESPONSIBLE  PERSON 

a.  REPORT 

unclassified 

b.  ABSTRACT 

unclassified 

c.  THIS  PAGE 

unclassified 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


Contribution  of  Moore’s  Law  to 
Improvements  of  Embedded  Systems 


*  Price/Performance:  Gigaflops/$M  affordability 

*  Memory  Capacity:  programming  simplifications 

*  Steep  memory  hierarchy:  programming  inefficiencies 
and  complexities 

*  New  flexibilities:  e.g.  reconfigurable  hardware 

*  New  complexities:  software  and  parallelism 

*  Dramatic  new  system  capabilities 


Extending  the  Affordability  Trend 


Exponentially  Improving  HPC  Affordability 
Transitioned  to  AF  Users 
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CHALLENGE: 

Develop  and  Incorporate  the  Most 

Affordable  Embedded  Information 

Technology  Available 

APPROACH: 

•  Leverage  Commercial 
Investments  In  Computer 
Architectures 

•  Develop  Portable  Embedded  DoD 
Applications  using  middleware 
standards 

•  Leverage  DARPA,  and  Other  DoD 
Efforts  In  Emerging  Architectures 


Projection: 

2007:  60  trillion  flops/$M 
2010:  360  trillion  flops/$M 
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Airborne  COTS  HPC 


25  Dual-Xeon  Compute  Nodes 

*  Dual  GbE  Network  (80  MB/s  per  link) 

•  1U  Rack-mounted 


•  $87K  approximate  cost  -  3.1  DP  TFLOPS/$1  M 


Commodity  Compute  Node: 

*  Dual  2.66  GHz  Intel  Pentium  4  Xeon  CPU 

—  512KB  L2  Cache 

—  21.3  SP  GFLOPS  peak  per  node 

*  Dual  GbE  network  interfaces 

*  6  GB  DDR  SDRAM 


a  _  l  SBtrlC 


Peak  Performance: 


532.5  GFLOPS  (IEEE  32  bit) 


Swathbuckler  Cluster 
(Coyote) 


5 


Contribution  of  Moore’s  Law  to 
Improvements  of  Embedded  Systems 


*  Price/Performance:  Gigaflops/$M  affordability 

*  Memory  Capacity:  programming  simplifications 

*  Steep  memory  hierarchy:  programming  inefficiencies 
and  complexities 

*  New  flexibilities:  e.g.  reconfigurable  hardware 

*  New  complexities:  software  and  parallelism 

*  Dramatic  new  system  capabilities 


6 


Can  we  preserve  historical 
improvement  rates? 


Azimuth  Processing 
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Heterogeneous  HPC  Hardware 


48  Nodes  in  2  cabinets 
Server  product  leverage 

Each  node  with:  Dual  2.2  GHz+  Processors 

—  4  Gbyte  SDRAM 

—  Myrinet  320  MB/sec  Interconnect 

—  80  GB  disk 

—  12  M  gate  Adaptive  Computing  Board 

34  TOPs  demonstrated 

Online  FEB  2003  supporting  HIE,  TTCP  and  SBR 
projects 


Heterogeneous  High 
Performance  Computer 
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Software  Complexity  &  Cost 
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Number  of  Function  Points 


Total  Onboard  Computer  Capacity  (OFF) 


•  Traditional  Embedded  SW  Development 

—  Low  Level  Programming 
—  Ad  Hoc  Approaches 
—  Stand-alone,  Static  Implementations 
—  Custom  Systems 
—  Little  Code  Re-use 
—  V  &  V  is  labor  intensive  (e.g.  f/a-22) 

•  Result  is.... 

—  Prolonged  Design  Schedules 
—  Excessive  Cost 

—  Difficulty  in  Maintenance/  Upgrade/  Retrofit 
—  Limits  on  Functionality 

•  Looking  Ahead 

—  Increased  Functionality  =>  Complexity 
—  Move  Towards  Networking,  Interconnecting  Systems 
—  Distributed  Computation  Models 
—  Reconfigurability,  Dynamic  Modifications 
—  Code  Correctness/  Safety  Concerns 
—  Security 

—  Shortage  of  Skilled  Programmers 


Advanced  tools  and  techniques  are  need 
to  address  increasing  complexity! 
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Can  we  preserve  historical 
improvement  rates? 


i. 


Of  necessity,  innovate  in  the  broader  architecture  design 
space 

slowing  VLSI  motivates  more  architectural  exploration 
Invest  hardware  to  simplify  software 

Encapsulate  sophisticated  software  to  achieve  performance 
with  portability 
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